Graph-Enhanced Policy Optimization in LLM Agent Training
arxiv.org·4d
Model optimizations in LLMs
Flag this post
Reasoning Models Sometimes Output Illegible Chains of Thought
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
From Narrative to Action: A Hierarchical LLM-Agent Framework for Human Mobility Generation
arxiv.org·5d
Model optimizations in LLMs
Flag this post
PORTool: Tool-Use LLM Training with Rewarded Tree
arxiv.org·4d
🧠Large Language Models (LLMs)
Flag this post
Simplifying Preference Elicitation in Local Energy Markets: Combinatorial Clock Exchange
arxiv.org·1d
🌐Distributed LLM Systems
Flag this post
Do Not Step Into the Same River Twice: Learning to Reason from Trial and Error
arxiv.org·4d
🧠Large Language Models (LLMs)
Flag this post
Independent Clinical Evaluation of General-Purpose LLM Responses to Signals of Suicide Risk
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry
arxiv.org·1d
💬Prompt optimizations for LLM serving
Flag this post
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Simulating and Experimenting with Social Media Mobilization Using LLM Agents
arxiv.org·4d
🧠Large Language Models (LLMs)
Flag this post